Structure and Term Prediction for Mathematical Text

نویسندگان

  • Oleksandr Polozov
  • Sumit Gulwani
  • Sriram Rajamani
چکیده

Mathematical text is too cumbersome to write because of the need to encode a tree structure in a left to right linear order. This paper defines two novel problems, namely structure prediction from unstructured representation and sequence prediction within a mathematical session, to help address mathematical text entry. The effectiveness of our approach relies on the fact that normal mathematical text is highly symmetric. Our solution to the structure prediction problem involves defining a ranking measure that captures symmetry of a mathematical term, and an algorithm for efficiently finding the structure with the highest rank. Our solution to the sequence prediction problem involves defining a domain-specific language for term transformations, and an inductive synthesis algorithm that can learn the likely transformation from the first couple of sequence elements. Our tool is able to predict the correct structure in 63% of the cases, and save more then half of sequence typing time in 52% of the cases on our benchmark collection. We argue that such algorithms are important components of human-computer interfaces for inputting mathematical text, be it through speech, keyboard, touch or multimodal interfaces.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Document Weighted Approach for Gender and Age Prediction Based on Term Weight Measure

Author profiling is a text classification technique, which is used to predict the profiles of unknown text by analyzing their writing styles. Author profiles are the characteristics of the authors like gender, age, nativity language, country and educational background. The existing approaches for Author Profiling suffered from problems like high dimensionality of features and fail to capture th...

متن کامل

Novel Atom-Type-Based Topological Descriptors for Simultaneous Prediction of Gas Chromatographic Retention Indices of Saturated Alcohols on Different Stationary Phases

In this work, novel atom-type-based topological indices, named AT indices, were presented as descriptors to encode structural information of a molecule at the atomic level. The descriptors were successfully used for simultaneous quantitative structure-retention relationship (QSRR) modeling of saturated alcohols on different stationary phases (SE-30, OV-3, OV-7, OV-11, OV-17 and OV-25). At first...

متن کامل

Quantitative structure activity relationship study of inhibitory activities of 5-lipoxygenase and design new compounds by different chemometrics methods

A quantitative structure-activity relationship (QSAR) study was conducted for the prediction of inhibitory activity of 1-phenyl[2H]-tetrahydro-triazine-3-one analogues as inhibitors of 5-Lipoxygenase. The inhibitory activities of the 1-phenyl[2H]-tetrahydro-triazine-3-one analogues modeled as a function of molecular structures using chemometrics methods such as multiple linear regression (MLR) ...

متن کامل

PreRkTAG: Prediction of RNA Knotted Structures Using Tree Adjoining Grammars

Background: RNA molecules play many important regulatory, catalytic and structural <span style="font-variant: normal; font-style: norma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014